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Abstract 

This paper describes a method for com- 
piling a constraint-based grammar into 
a potentially more efficient form for pro- 
cessing. This method takes dependent 
disjunctions within a constraint formula 
and factors them into non-interacting 
groups whenever possible by determining 
their independence. When a group of de- 
pendent disjunctions is split into smaller 
groups, an exponential amount of redun- 
dant information is reduced. At runtime, 
this means that an exponential amount 
of processing can be saved as well. Since 
the performance of an algorithm for pro- 
cessing constraints with dependent dis- 
junctions is highly determined by its in- 
put, the transformation presented in this 
paper should prove beneficial for all such 
algorithms. 

1 Introduction 



incrc arc two tacts that conspire to make tile 



exponential-time problem as soon as disjunctions 
are included]]] Since disjunction is unlikely to dis- 
appear from natural language grammars, control- 
ling its form can save exponential amounts of time. 

This paper introduces an efficient normal form 
for processing dependent disjunctive constraints 
and an operation for compilation into this normal 
form. This operation, modularization, can reduce 
exponential amounts of redundant information in 
a grammar and can consequently save correspond- 
ing amounts of processing time. While this oper- 
ation is general enough to be applied to a wide 
variety of constraint systems, it was originally de- 
signed to optimize processing of dependent dis- 
junctions in feature structure-based grammars. In 
particular, modular feature structures are more 
efficient for unification than non-modular ones. 
Since in many current NLP systems, a signifi- 
cant amount of time is spent performing unifica- 
tion, optimizing feature structures for unification 
should increase the performance of these systems. 

Many algorithms for efficient unification of fea- 
ture structures with dependent disjunctions have 
been proposed (Maxwell and Kaplan, 1989|; Eisclc 



treatment ot disjunction an important consider- 



ation when building a natural language proccss- 
ing (NLP) system. The first fact is that nat- 
ural languages are full of ambiguities, and in 
a grammar many of these ambiguities are de- 
scribed by disjunctions. The second fact is that 
the introduction of disjunction into a grammar 
causes processing time to increase exponentially 
in the number of disjuncts. This means that a 
nearly linear-time operation, such as unification of 
purely conjunctive feature structures, becomes an 
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and Dorre, 1990 ; Gerdemann, 1991 ; [gtromback 
I992| ; [Griffith, I99e| ). However, all of these al- 
gorithms suffer from a common problem: their 
performance is highly determined by their inputs. 
All of these algorithms will perform at their best 
when their dependent disjunctions interact as lit- 
tle as possible, but if all of the disjunctions inter- 
act, then these algorithms may perform redundant 
computations. The need for efficient inputs has 
been noted in the literature^] but there have been 
few attempts to automatically optimize grammars 
for disjunctive unification algorithms. 

The modularization algorithm presented in this 
paper takes existing dependent disjunctions and 
splits them into independent groups by determin- 



1 Assu ming P £ NP. 

2 Cf. (Maxwell and Kaplan, 1991) for instance. 



ing which disjunctions really interact. Indepen- 
dent groups of disjunctions can be processed sepa- 
rately during unification rather than having to try 
every combination of one group with every com- 
bination of every other group. 

This paper is organized as follows: Section || 
gives an informal introduction to dependent dis- 
junctions and shows how redundant interactions 
between groups of disjunctions can be reduced. 
Section [| shows how normal disjunctions can be 
replaced by contexted constraints. Section || then 
shows how these contexted constraints can en- 
code dependent disjunctions. Section || presents 
the modularization algorithm for contexted con- 
straints. However, even though this algorithm is 



(41 <j>, (/)') A (d" if), ip'). This is the process of mod- 
ularization which will be formalized in section 

One might be tempted to think that modular- 
ization is unnecessary since grammar writers are 
unlikely to write dependent disjunctions which 
contain independent parts. However, grammar 
writers may not be the only source of dependent 
disjunctions. Many grammar processing systems 
use high-level descriptions which are then trans- 
formed into more explicit lower- level grammars. 
This transformation process may very well in- 
troduce large numbers of dependent disjunctions 
with exactly this property. 

One example of where this can happen is in 



the compilation of lexical rules in (Mcurers and 



a compile-time operation, it itself has exponential Minncn, 1995). In this paper, Meurers and Min 



complexity, so making it more efficient should also 
be a concern. A theorem will then be presented 
in section ^| that permits an exponential part of 
the modularization algorithm to be replaced by 
combinatorial analysis. 

2 Dependent disjunctions 

Dependent disjunctions are like normal disjunc- 
tions except that every disjunction has a name, 
and the disjuncts of disjunctions with the same 
name must be chosen in sync. For example, 
(d 4>, <j>' , 4>") A (d tp, ip', "0") is a conjunction of two 
dependent disjunctions with the same name, d. 
What this means is that if the second disjunct in 
the first disjunction, <f>' , is chosen, then the sec- 
ond disjunct of the other disjunction, ip', must 
be chosen as well. (Note that what kind of con- 
straints the cps and ips are is not important here.) 
The computational reason for using dependent 
disjunctions over normal disjunctions is that de- 
pendent disjunctions allow for more compact and 
efficient structures. This is particularly true when 
dependent disjunctions are embedded inside of 
feature structures. This is because in that case 
disjunctions can be kept local in a directed graph 
structure thus saving redundant feature paths. 

We say that disjunctions with the same name 
are in the same group. One distinguishing fea- 
ture of a group of disjunctions is that each dis- 
junction must have the same number of dis- 
juncts. This is essentially where redundant inter- 
actions originate. For instance, in (d <f>, (p, <f>' , 4>') A 
{d V' i ip' ^ V' i ip') each disjunction has four disjuncts , 



nen describe a compiler which translates a set of 
HPSG lexical rules and their interaction into def- 
inite relations used to constrain lexical entries. 
In ( Meurers and Minncn, 1996) ) , they show how 
an off-line compilation technique called constraint 
propagation can be used to improve the definite 
clause encoding produced by their compiler to al- 
low for more efficient processing. The use of de- 
pendent disjunctions provides an attractive alter- 
native to the constraint propagation approach by 
specifying all the information associated with a 
lexical entry directly as a single dependent fea- 
ture structure rather than hidden in a set of defi- 
nite clauses.Fl Consider the AVM below: 



^ licbcn. licbcn, licbt, licbt^. 
^ bsc, bsc, fin. fin^ 



VFORM 1 bsc, bsc, fi 
Id 

SUBJ [7] 
COMPS < { 



VFORM bsc 
CONT I 2 I 



),(),(),( 



VFORM bsc 
CONT I a I 



CONT 



AR.G1 
AR.G2 



SLASH 1 ( ), < 



VFORM bsc 
CONT [T] 



>. ( 



VFORM bsc 
CONT IT] 



>, ( > 



This complex lexical entry represents the base lex- 
ical entry for the German verb lieben, "to love", 
and the three lexical entries that can be derived 



but really only two values , — But more impor- ers and Minncn, 199(: ) 



from it given the lexical rules presented in ( Mcur- 

The differences between 



tantly, no matter what value of the first disjunc- 
tion is chosen (</> or <j>') the same values are pos- 
sible for the second (ip or ip'). In other words, 
these disjunctions are actually independent from 
one another, and can be put into different groups: 



these lexical entries are encoded by the dependent 
disjunctions all of which are in the same group, 
d. The first disjunct in each disjunction corre- 



In the case of infinite lexica, definite clauses are 
still necessary to encode recursive information. 



sponds to the base form, the second corresponds 
to the application of the Complement Extraction 
Lexical Rule, the third corresponds to the appli- 
cation of the Finitivization Lexical Rule, and the 
last corresponds to the application of both rulesj^j 
Modularization can be used to make this feature 
structure even more efficient by splitting the group 
d into two new groups d\ and c?2 as shown below. 



^ licbcn, licbtl, 

J bse, fin K 
Idl J 



COMPS I < 
d 2 



VFORM bsc 
CONT IT] 



>, ( > 



ARGl 
ARG2 

( >, ( 



VFORM bsc 

cont rrj 



Another example of where modularization 
might prove useful is in the treatment of typed 
feature structures presented in (Gcrdcmann and 
King, 1994). Their approach produces a set of 



feature structures from a satisfiability algorithm 
such that all of the feature structures have the 
same shape but the nodes may be labeled by dif- 
ferent types. They then collapse this set down to 
a single feature structure where nodes are labeled 
with dependent disjunctions of types. Many of the 
groups of disjunctions in their feature structures 
can be made more efficient via modularization. 

A final example is in the compaction algo- 
rithm for feature structures, presented in ( priffith 



1995| ). Compaction is another operation designed 
to optimize feature structures for unification. It 
takes a disjunction of feature structures, trans- 
forms them into a single feature structure with 
dependent disjunctions, and then pushes the dis- 
junctions down in the structure as far as possible. 
The result is a large number of dependent dis- 
junctions in the same group. Many of these can 
probably be split into new independent groups. 

3 Contexted constraints 



Maxwell and Kaplan (1989[ ) showed how a dis 



junction of constraints could be replaced by 
an equi-satisfiable conjunction of contexted con- 



straints as in lemma [l] below J^] 
Lemma 1 (Contexted Constraints) 

01 V 4>2 is satisfiable iff (p — i <f>i) A (p — > fo) is 
satisfiable, where p is a new propositional variable. 

Disjunctions are replaced by conjunctions of im- 
plications from contexts (propositional formulae) 
to the base constraints (ie. <p\ and fa). The na- 
ture of the base constraints is irrelevant as long 
as there is a satisfaction algorithm for them. The 
key insight is that solving disjunctions of the base 
constraints is no longer necessary since they are 
purely conjunctive. 

Maxwell and Kaplan's goal in doing this was 
to have an efficient method for solving disjunctive 
constraints. The goal in this paper is compiling 
disjunctive constraints into more efficient ones for 
future solution. To this end a somewhat different 
notion of contexted constraint will be used as show 
in lemma ^. 

Lemma 2 (Alternative- Case Form) 

4>i V (j>2 is satisfiable iff {a\ — > pi) A (02 — > §2) A 
(ai V 0,2) is satisfiable, where a\ and 0,2 are new 
propositional variables. 

We can see that this formulation is nearly equiva- 
lent to Maxwell and Kaplan's by substituting p 
for a\ and p for 02- To make the formulation 
completely equivalent, we would need to enforce 
the uniqueness of a solution by conjoining ai V02. 
However, this is unnecessary since we want to per- 
mit both solutions to be simultaneously true. The 
reason for using the modified version of contexted 
constraints in lemma ^| is that we can separate the 
representation of disjunctions into a conjunction 
of the values that the disjuncts can have, called 
the alternatives, and the way in which the we can 
choose the values, called the cases. The alterna- 
tives are the conjunction (ai — > 4>i) A (02 — > P2) 
and the cases are the disjunction (ai Vfl2). 

While we could use repeated applications of 
lemma |^ to turn a disjunction of n disjuncts into 
an alternative-case form, it will simplify the expo- 
sition to have a more general way of doing this, as 
shown in lemma [| 

Lemma 3 (N-ary Alternative-Case Form) 

Y is satisfiable iff /\(ai — » (pi) A W a i i s sa ^~ 

isfiable, where each ai is a new propositional vari- 
able^ 



4 These lex ical rules are simplified versions of those 
presented in flPollard and Sag, 1994[ ). 



5 For a proof see (Maxwell and Kaplan, 1989). 
6 \J </>i an d A Pi are disjunctions and conjunctions 

iGiV iSJV 

of formulae (pi, respectively, where each i is a member 
of the set of indices, N. 



Here A(<ii — ► (pi) are the alternatives and y dj 

are the cases. So for example, 0i V 02 V 03 V 04 
is satisfiable just in case (a\ — > 0i) A (02 — » 02 ) A 
(03 — > 03) A (04 — > 04) A (ai V02 V 03 V 04) is 
satisfiable. 

4 Dependent disjunctions as 
contexted constraints 

The usefulness of the alternative-case form only 
becomes apparent when considering dependent 
disjunctions. Dependent disjunctions can be rep- 
resented by alternative-case forms as shown in def- 
inition [l] below. 

Definition 1 (Dependency Group) 

A dependency group is a conjunction of dependent 
disjunctions with the same name, d, where each 
disjunction is an alternative- case form such that 
there is one alternative for every disjunct of ev- 
ery disjunction in the group, and there is one case 
for each disjunct in the group which is a conjunc- 
tion of the alternative variables for that disjunct 
in every disjunction: 

- A A : < 'v V A 4 

ieM jeN jeN ieM 

where each a* is a new propositional variable and 
AT = {l,2,...,7i}. 

So the dependent disjunction (d 0, 0, 0') A 
(d ip, ip', ip') is the alternative-case form with al- 
ternatives (a\ — > 0) A [a\ — > 0) A (a\ — > 0') A 
(a\ —> ip) A (&2 — > ip') A (a| — > -0') and cases 
((a{ Aa\) y{a\ Aa%) V {a\ Aa§)). The cases enforce 
that the corresponding disjuncts of every disjunct 
in the group must be simultaneously satisfiable. 

We can now start to see where redundancy in 
dependent disjunctions originates. Because every 
disjunction in a group of dependent disjunctions 
must have the same number of disjuncts, some of 
those disjuncts may appear more than once. In 
the above example for instance, occurs twice in 
the first disjunction and ip' occurs twice in the 
second disjunction. To resolve this problem we 
impose the following condition, called alternative 
compactness: if a base constraint 0*- equals an- 
other base constraint from the same disjunction, 
4>\ , then the alternatives variables associated with 
those base constraints, a] and a\, are also equalj^j 
Doing this allows us to express the alternatives 
from the example above as (a\ — > 0) A — > 
0') A [a\ — » ip) A {a\ — > ip'), and the cases as 

7 Note that this requires being able to determine 
equality of the base constraints. 



({a\ A a\) V {a\ A a\) V {a\ A o§)).@ One advan- 
tage of this is that the number of base constraints 
that must be checked during satisfaction can po- 
tentially be exponentially reduced. 

The next section will show how an alternative- 
case form for a group of dependent disjunctions 
can be split into a conjunction of two (or more) 
equivalent forms, thereby (potentially) exponen- 
tially reducing the number of alternative variable 
interactions that must be checked during satisfac- 
tion. 

5 Modularization 

Consider again the example from section |^: 
(d 0, 0, 0', 0') A (d ip, ip', ip, ip')- Represented as a 
compact alternative-case form, the alternatives 
becomes: (a\ — > <p)A(al — > 0')A(a| — > i/0M a 2 ~^ 
■ip'), with cases: ((a\ A a\) V (a\ A a|) V (a| A 
af) V (0.3 A a^)). The key to determining that 
the two disjunctions can be split into different 
groups then involves determining that cases can 
be split into a conjunction of two smaller cases 
(a\ V a\) A (a\ V a|). If the cases can be split in 
this manner, we say the cases (and by extension 
the group of dependent disjunctions) are indepen- 
dent. 

Definition 2 (Independence) 

A case form is independent iff it is equivalent to 
the conjunction of 2 (or more) other cases forms: 

V A4= V A4 A V A 4 

jeN ieM jeN' ieM' jeN" ieM" 
where M' and M" partition M . 

So in the above example, M = {1, 2} where 1 rep- 
resents the first disjunction and 2 represents the 
second. That makes M' = {1} and M" = {2}. 
While M' and M" are derived from M, the ele- 
ments of the Ns are arbitrary. But a consequence 
of definition | is that \N\ = |JV'| x \N"\. This 
will be proved in section The size of the iVs, 
however, represent the number of cases. So for 
instance in the above example, TV might equal 
{1,2,3,4} since there are four disjuncts in the 
original case form, while N' might equal {1,2} 
and AT", {1,2}, since the smaller case forms each 
contain two disjuncts. 

The process of splitting a group of dependent 
disjunctions into smaller groups is called modu- 
larization. Modularizing a group of dependent 
disjunctions amounts to finding a conjunction of 
case forms that is equivalent to the original case 

8 In this example, equivalent alternative variables 
have been replaced by representatives of their equiva- 
lence class. So a\ has been replaced by a\ and a| has 
been replaced by a\. 



form. The modularization algorithm consists of 
two main steps. The first is to take the original 
case form and to construct a pair of possibly in- 
dependent case forms from it. The second step is 
to check if these case forms are actually indepen- 
dent from each other with respect to the original 
one. The modularization algorithm performs both 
of these steps repeatedly until either a pair of in- 
dependent case forms is found or until all possi- 
ble pairs have been checked. If the later, then we 
know that the original dependent disjunction is al- 
ready modular. If on the other hand we can split 
the case forms into a pair of smaller, independent 
case forms, then we can again try to modularize 
each of those, until all groups are modular. 

To construct a pair of potentially independent 
case forms, we first need to partition the set of 
alternative variables from the original case form 
into two sets. The first subset contains all of and 
only the variables corresponding to some subset 
of the original disjunctions and the second subset 
of variables is the complement of the first, corre- 
sponding to all of and only the other disjunctions. 
From these subsets of variables, we construct two 
new case forms from the original using the opera- 
tion of confinement, defined below. 

Definition 3 (Confinement) 

conf(\/ /\ a),M>) 

is the confinement of y A a* with respect to a 
set of indices, M' , 

iff conf( \/ /\ a}, M') = dnf{ \J f\ a) ), 

j£N ieM j£N ieM' 

where M' CM. 

Constructing the confinement of a case form is 
essentially just throwing out all of the alternative 
variables that are not in M' . However, only doing 
this might leave us with duplicate disjuncts, so 
converting the result to DNF removes any such 
duplicates. 

To make the definition of confinement clearer, 
consider the following conjunction of dependent 
disjunctions: 

(d <j>, 0, <t>, 0, <t>' A') A (d ip, ip', ip, ip', ip, ip')A 

(dx,x,x',x',x',x')- 

This is equivalent to the compact alternative 
form:^ 

(a\ —></>) A (4 -> 0') A (a\ -> tJj)A 
(a 2 2 -» V') A (a? -» X ) A (4 - x'), 
and the following case form: case = 

9 In this example, equivalent alternative variables 
have again been replaced by representatives of their 
equivalence class. So for instance, a\, a\ and a\ are 
all represented by a\. 



((a\ A a? A a\) V (a\ Ao^A a\) V {a\ A a? A a|)V 
(a\ A a\ A af) V (a\ A a\ A a\) V (a\ A a| A af)). 
Now we can compute the confinements. For in- 
stance, 

conf(case, {I, 2}) = dnf((a{ A a\) V (a\ A a|)V 
(a\ A a\) V {a\ A a%) V {a\ A a\) V {a\ A a|)). 
After removing duplicates we get: 
conf(case, {1, 2}) = 

{{a\ A a\) V {a\ A a\) V [a\ A af) V (al A af)). 
Likewise, for the complement of M' with respect 
to M , we get: 

conf{case, {3}) = {{a\) V (4)). 

Now we just need to test whether two confined 
case forms are independent with respect to the 
original. This is done with the free combination 
operation, shown in definition |J. 

Definition 4 (Free Combination <S>) 

The free combination of two case forms is the dis- 
junctive normal form of their conjunction: 
case' ® case" = dnf(case' A case") 

The two case forms, case' and case" , are DNF for- 
mulae. To compute the free combination, we con- 
join them and convert the result back into DNF. 
They are independence if their free combination 
is equal to the original case form, case. 

For example, the free combination of the two 
confinements from above, 

{{a{ A a\) V (a{ A a|) V {a\ A a\) V (a\ A a|)) and 

((«?)V(aD) 

is 

((a{ Aoj A af) V (aj Aa 2 2 A a\) V {a\ Aa\ A a?)V 
(a\ Aa\A a\) V {a\ Aa\ A 4) V (a{ A a\ A a|)V 
{a\Aa\Aa\)\J {a\Aa\Aa%)) 
which is not equal to the original case form: 
{{a\ Aa\ A a\) V (a\ Aa 2 2 A a\) V {a\ Aa\ A a\)y 
(a\ A a 2 , A af) V [a\ Aa\ A af) V (a| A a\ A af)), 
so the first two disjunctions are not indepen- 
dent from the third. However, the second dis- 
junction is independent from the first and the 
third since conficase, {2}) = ((af) V (a?,)), and 
conf{case, {1, 3}) = ((a}Aa?)V(a}Aa|)V(4Aa|)), 
and their free combination is equal to the origi- 
nal case form. Therefore, the original formula is 
equivalent to ( d > ^,V')A(d» 0»') A (rf" X,x',x')- 

6 Free combination elimination 

The last section showed an effective algorithm for 
modularizing groups of dependent disjunctions. 
However, even though this is a compile time al- 
gorithm we should be concerned about its effi- 
ciency since it has exponential complexity. The 
main source of complexity is that we might have to 
check every pair of subsets of disjunctions from the 



group. In the worst case this is unavoidable (al- 
though we do not expect natural language gram- 
mars to exhibit such behavior). Other sources of 
complexity are computing the free combination 
and testing the result against the original case 
form. Luckily it is possible to avoid both of these 
operations. This can be done by noting that both 
the original case form and each of the confined 
case forms are in DNF. Therefore it is a nec- 
essary condition that if the free combination of 
the confinements is the same as the original case 
form then the product of the number of disjuncts 
in each confinement, |case'| x |case"|, must equal 
the number of disjuncts in the original case form, 
|case|. Moreover, since both confinements arc de- 
rived from the original case form, it is also a suf- 
ficient condition. This is shown more formally in 
theorem [j]. 

Theorem 1 (Free combination elimination) 

case = case 1 £g> case" \case\ = \case'\ x |case"| 

Proof ==> We assume that case' <g> case" = case. 
Since both case' £g> case" and case are in DNF and 
DNF is unique, we know that \case' ® case"\ = 
\case\. We also know that case' and case" have 
no disjuncts in common because they have no al- 
ternative variables in common, so \case' ®case"\ = 
\case'\ x | case" | . Therefore, \case\ = \case'\ x 
jcase" | . □ 

Proof <= Again since case' and case" have 
no disjuncts in common, we know that |case' £5 
case"| = |case'| x |case"| and therefore, that 
|case| = \case'®case"\. Every disjunct in case can 
be represented as A' A A" where A' is a disjunct 
in case' and A" is a disjunct in case" . So the dis- 
juncts in case' ® case" must be every conjunction 
of possible A's and A"s. So case' £§) case" must 
contain all of the disjuncts in case and it could 
contain even more, but then |case' (g> case"| > 
|case|. However, since |case| = |case' ® case"|, 
case' (E) case" must contain exactly the disjuncts 
in case and therefore case = case' ® case" . □ 

We can see that this would have helped us in the 
previous example to know that conf (case, {1,2}) 
could not be independent from conf(case, {3}) 
with respect to case because \conf(case, {1, 2})| = 
4 and \conf(case, {3})| = 2 but |case| = 6, not 
8. Conversely, since \conf(case, {1, 3})| = 3 and 
\conf(case, {2})| = 2, we know immediately that 
these case forms are independent. 

This theorem also allows us to perform other 
combinatorial short cuts, such as noting that if 
the number of disjuncts in the original case form 
is prime then it is already modular. 



7 Conclusion 

This paper has presented an efficient form for 
representing dependent disjunctions and an algo- 
rithm for detecting and eliminating redundant in- 
teractions within a group of dependent disjunc- 
tions. This method should be useful for any sys- 
tem which employs dependent disjunctions since 
it can eliminate exponential amounts of processing 
during constraint satisfaction. 
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